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Abstract 
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Consider a dynamical system given by a planar differential equation, which exhibits 
an unstable periodic orbit surrounding a stable periodic orbit. It is known that under 
random perturbations, the distribution of locations where the system's first exit from 
the interior of the unstable orbit occurs, typically displays the phenomenon of cycling: 
. The distribution of first-exit locations is translated along the unstable periodic orbit 

Qh ' proportionally to the logarithm of the noise intensity as the noise intensity goes to 

^ (— I zero. We show that for a large class of such systems, the cycling profile is given, 

up to a model-dependent change of coordinates, by a universal function given by a 
periodicised Gumbel distribution. Our techniques combine action-functional or large- 
deviation results with properties of random Poincare maps described by continuous- 
space discrete-time Markov chains. 
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1 Introduction 

Many interesting effects of noise on deterministic dynamical systems can be expressed as a 
stochastic exit problem. Given a subset V of phase space, usually assumed to be positively 
invariant under the deterministic flow, the stochastic exit problem consists in determining 
when and where the noise causes solutions to leave T>. 

If the deterministic flow points inward T> on the boundary dT>, then the theory of 
large deviations provides useful answers to the exit problem in the limit of small noise 
intensity |FW98| . Typically, the exit locations are concentrated in one or several points, 
in which the so-called quasipotential is minimal. The mean exit time is exponentially long 
as a function of the noise intensity, and the distribution of exit times is asymptotically 
exponential |Day83|. 

The situation is more complicated when dT>, or some part of it, is invariant under the 
deterministic flow. Then the theory of large deviations does not suffice to characterise 
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the distribution of exit locations. An important particular case is the one of a two- 
dimensional deterministic ordinary differential equation (ODE), admitting an unstable 
periodic orbit. Let T> be the part of the plane inside the periodic orbit. Day |Day90a 



Day90b| discovered a striking phenomenon called cycling: As the noise intensity a goes to 



zero, the exit distribution rotates around the boundary dV, by an angle proportional to 
| log a\ . Thus the exit distribution does not converge as a — > 0. The phenomenon of cycling 
has been further analysed in several works by Day |Day92 Day94 Day96| , by Maier and 
Stein [MS96llMS97j . and by Getfert and Reimann |GR091 [GRIP] . 

The noise-induced exit through an unstable periodic orbit has many important appli- 
cations. For instance, in synchronisation it determines the distribution of noise-induced 
phase slips [PRK01]. The first-exit distribution also determines the residence-time distri- 
bution in stochastic resonance [GHJM98J IMSOll lBG05j. In neuroscience, the interspike 
interval statistics of spiking neurons is described by a stochastic exit problem |Tuc75[ 
ITuc891 lBG09t IBL1 2] . In certain cases, as for the Morris-Lecar model |ML81j for a region 
of parameter values, the spiking mechanism involves the passage through an unstable pe- 
riodic orbit (see, e.g. [RE891 ITP041 lTKY+061 IDG12Q . In all these cases, it is important 
to know the distribution of first-exit locations as precisely as possible. 

In |BG04| . we introduced a simplified model, consisting of two linearised systems 
patched together by a switching mechanism, for which we obtained an explicit expres- 
sion for the exit distribution. In appropriate coordinates, the distribution has the form of 
a periodicised Gumbel distribution, which is common in extreme-value theory. Note that 
the standard Gumbel distribution also occurs in the description of reaction paths for over- 
damped Langevin dynamics [CGLM12]. The aim of the present work is to generalise the 
results of [BG04] to a larger class of more realistic systems. Two important ingredients of 
the analysis are large-deviation estimates near the unstable periodic orbit, and the theory 
of continuous-space Markov chains describing random Poincare maps. 

The remainder of this paper is organised as follows. In Section [21 we define the system 
under study, discuss the heuristics of its behaviour, and state the main result (Theo- 
rem [2]3]). Subsequent sections are devoted to the proof of this result. Section [3] describes 
a coordinate transformation to polar-type coordinates used throughout the analysis. Sec- 
tion U] contains the large-deviation estimates for the dynamics near the unstable orbit. 
Section [5] states results on Markov chains and random Poincare maps, while Section [6] 
contains estimates on the sample-path behaviour needed to apply the results on Markov 
chains. Finally, in Section [7] we complete the proof of Theorem [ 



2 Results 

2.1 Stochastic differential equations with an unstable periodic orbit 

Consider the two-dimensional deterministic ODE 

i = f(z) , (2.1) 

where / E C 2 (T>o,W 2 ) for some open, connected set T>q Cl 2 . We assume that this system 
admits two distinct periodic orbits, that is, there are periodic functions 7± : M — > 1R 2 , of 
respective periods T±, such that 

7±(t) = /(7±(*)) Vt € K . (2.2) 
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Figure 1. Geometry of the periodic orbits. The stable orbit 7_ is located inside the 
unstable orbit 7+. T±(<p) denote parametrisations of the orbits, and u±(ip) are eigenvectors 
of the monodromy matrix used to construct a set of polar-type coordinates. 



We set r±(yj) = 7±(T±y>), so that tp G S 1 = R/Z gives an equal-time parametrisation of 
the orbits. Indeed, 

^r ± (^) = T ± /(r ± M), (2.3) 

and thus p = 1/T± is constant on the periodic orbits. 

Concerning the geometry, we will assume that the orbit T_ is contained in the interior 
of r + , and that the annulus-shaped region S between the two orbits contains no invariant 
proper subset. This implies in particular that the orbit through any point in S approaches 
one of the orbits T± as t — > 00 and t — > — 00. 

Let A±(tp) = d z f(T±(ip)) denote the Jacobian matrices of / at T±(p>). The principal 
solutions associated with the linearisation around the periodic orbits are defined by 

dyUifaw) = T±A±(<p)U±(<p,<po) , U±(<p ,<p ) = 1 • (2.4) 

In particular, the monodromy matrices U±{y> + 1, (p) satisfy 

detU ± (ip + l,ip) = expjT ± J V+ TvA ± {p') d^'j , (2.5) 

with TrA±(ip') = div f(T±((p')). Taking the derivative of (|2.3h shows that each mon- 
odromy matrix U±(ip + 1, <p) admits f(T±((p)) as eigenvector with eigenvalue 1. The other 
eigenvalue is thus also independent of <p, and we denote it e ±A±T± , where 

±A±= C div f(T±(<p)) dp (2.6) 
J 

are the Lyapunov exponents of the orbits. We assume that A + and A_ are both positive, 
which implies that T_ is stable and T + is unstable. 

Consider now the stochastic differential equation (SDE) 

dzt = f(z t ) dt + ag(z t ) dW t , (2.7) 

where / satisfies the same assumptions as before, {Wi}t is a A;-dimensional standard Brow- 
nian motion, k ^ 2, and g E C 1 (T>o,M> 2xk ) satisfies the uniform ellipticity condition 

ci\\f\\^ (Z,g{z)g(zft) ^c 2 \\e\\ VzGPoVeeM 2 (2.8) 

with C2 ^ ci > 0. 
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Proposition 2.1 (Polar-type coordinates). There exist L > 1 and a set of coordinates 
(r,(p) G (—L,L) xl, m which the SDE (j2.7|) ta/ces £/ie /orm 

dr t = / r (r t) v? t ; cr) di + crft-fo, </? t ) dW t , 

d<£t = /^(n, <ff, cr) dt + ag v (r t , ipt) dW t . (2.9) 

XTie functions f r , f v ,g r and g v are periodic with period 1 in (p, and g r ,gtn satisfy a uniform 
ellipticity condition similar to (|2.8p . The unstable orbit lies in r = l + 0(a 2 ), and 

/ r (r,^) = A + (r-l) + 0((r-l) 2 ), 

U(r,ip) = ± + 0((r-lf) (2.10) 

as r — > 1. T/ie stable orbit lies in r = — 1 + 0(a 2 ), and 

f r (r,<p) = -A_(r + l) + 0((r + l) 2 ) , 

^(r^) = ^ + 0((r + l) 2 ) (2.11) 

as r — )• —1. Furthermore, /„ is strictly larger than a positive constant for all (r,ip) G 
( — L, L) xl, and / r zs negative for — 1 < r < 1. 

We give the proof in Section We emphasize that after performing this change of 
coordinates, the stable and unstable orbit are not located exactly in r = ±1, but are 
slightly shifted by an amount of order a 2 , owing to second-order terms in Ito's formula. 

The question we are interested in is the following: Assume the system starts with 
some initial condition (ro,y>o) = ( r 0;0) close to the stable periodic orbit. What is the 
distribution of the first-hitting location of the unstable orbit? We define the first-hitting 
time of (a ^-neighbourhood of) the unstable orbit by 

r = M{t > 0: T t = 1} , (2.12) 

so that the random variable ip T gives the first-exit location. Note that we consider ip as 
belonging to R + instead of the circle R /Z , which means that we keep track of the number 
of rotations around the periodic orbits. 



2.2 Heuristics 1: Large deviations 

A first key ingredient to the understanding of the distribution of exit locations is the 
theory of large deviations, which has been developed in the context of SDEs by Freidlin 
and Wentzell |FW98j . The theory tells us that for a set V of paths 7 : [0,T] -> R 2 , one 
has 

-infl^ liminf o- 2 logP{(z t ) te [ T ] G T] < limsupu 2 logP{(z 4 ) tg [ T ] G T} < -mil , 

(2.13) 

where the rate function I = /[o,t] : C°([0,T],R 2 ) — > R + is given by 

/(7) \\ f\is ~ fils^D^T 1 ^ - /( 7s )) ds if 7 G H 1 , (214) 
^ +00 otherwise, 
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with D(z) = g(z)g(z) T (the diffusion matrix). Roughly speaking, Equation (|2.13p tells us 
that 



F{ (zt)te[o,T] e^e- infr ^ or, symbolically, P{ (z t ) te[m = 7} ~ . 

(2.15) 

For deterministic solutions, we have 7 = /(j) and 1(7) = 0, so that (I2.15h does not yield 
useful information. However, for paths 7 with 1(7) > 0, (|2.15p tells us how unlikely 7 is. 

The minimisers of I obey Euler-Lagrange equations, which are equivalent to Hamilton 
equations generated by the Hamiltonian 

H(l, VO = ^ T £(7)V> + /(7) T V> , (2.16) 

where ip = -D(7) _1 (7 — /(7)) is the moment conjugated to 7. The rate function thus takes 
the form 

= \ I ^D{ ls )^ s ds . (2.17) 



2 . . 

Writing ip T = (p r ,P<p), the Hamilton equations associated with (|2.16p read 

r = f r (r, if) + D rr (r, <p)p r + D rtp (r, (p)p v , 
<f = Ui r > V 5 ) + D rv (r, ip)p r + D w (r, <p)p v , 

Pr = -d r f r (r, <p)p r - d r f<p(r, tp)p v - - d rDij(r, ip)piPj , (2.I8) 

Pep = -dtpfrir, cp)p r - dtpf^r, <p)p v - - ^ dy, Aj(r, p)piPj , 

*ie{r,y} 

We can immediately note the following points: 

• the plane p r = p v = is invariant, it corresponds to the deterministic dynamics; 

• there are two periodic orbits, given by p r = p v = and r = ±1, which are, of course, 
the original periodic orbits of the deterministic system; 

• (p is positive, bounded away from zero, in a neighbourhood of the deterministic man- 
ifold. 

The Hamiltonian being a constant of the motion, the four-dimensional phase space is 
foliated in three-dimensional invariant manifolds, which can be labelled by the value of 
H. Since d Pr H = r is positive near the deterministic manifold, one can express p v as a 
function of H, r, ip and p r , and thus describe the dynamics on each invariant manifold by 
an effective three-dimensional equation for (r,(p,p r ). It is furthermore possible to use ip 
as new time, which yields a two-dimensional, non-autonomous equation!!! 

The linearisation of the system around the periodic orbits is given by 

The characteristic exponents of the periodic orbit in r = 1 are thus ±T + A+, and those of 
the periodic orbit in r = —1 are ±T_A_. The Poincare section at <p = will thus have 
hyperbolic fixed points at (r,p r ) = (±1,0). 



1 The associated Hamiltonian is the function P v (r,p r , H,tp) obtained by expressing p v as a function of 
the other variables. 
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FIGURE 2. Poincare section of the Hamiltonian flow associated with the large-deviation 
rate function. The stable periodic orbit is located in (— 1, 0), the unstable one in (1, 0). We 
assume that the unstable manifold W" of (—1, 0) intersects the stable manifold of (1, 0) 
transversally. The intersections of both manifolds define a heteroclinic orbit {z^}-oo<fc<oo 
which corresponds to the minimiser of the rate function. 



Consider now the event T that the stochastic system, starting on the stable orbit at 
<p = 0, hits the unstable orbit for the first time near ip = s. The probability of T will be 
determined by the infimum of the rate function / over all paths connecting (r, ip) = (—1,0) 
to (r,ip) = (l,s). Note however that if t > s, we can connect (1, s) to (l 3 i) for free in 
terms of the rate function / by following the deterministic dynamics along the unstable 
orbit. We conclude that on the level of large deviations, all exit points on the unstable 
orbit are equally likely. 

This does not mean, however, that all paths connecting the stable and unstable orbits 
are optimal. In fact, it turns out that the infimum of the rate function is reached on a 
heteroclinic orbit connecting the orbits in infinite time. It is possible to connect the orbits 
in finite time, at the cost of increasing the rate function. In what follows, we will make 
the following simplifying assumption. 

Assumption 2.2. In the Poincare section for H = 0, the unstable manifold of (—1,0) 
intersects the stable manifold of (1,0) transversally (Figure [2]). Let denote the 
heteroclinic orbit meeting the Poincare section at the set {^}-oo<fc<oo of intersections of 
the manifolds. Then minimises the rate function over all paths connecting the two 
periodic orbits, and this minimiser is unique (up to translations (p 1— > ip + 1). 

This assumption obviously fails to hold if the system is perfectly rotation symmetric, 
because then the two manifolds do not intersect transversally but are in fact identical. 
The assumption is likely to be true generically for small-amplitude perturbations of <p- 
independent systems (cf. Melnikov's method), for large periods T± (adiabatic limit) and 
for small periods (averaging regime), but may not hold in general. See in particular [GT841 
IGT851 [MS97J for discussions of possible complications. 

It will turn out in our analysis that the probability of crossing the unstable orbit near 
a sufficiently large finite value of (p will be determined by a finite number n = \_<p\ of 
translates of the minimising orbit. 
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Figure 3. The optimal path minimising the rate function, and its translates. Wc 
define a random Poincare map, giving the location R\ of the first crossing of the line 
ip = 1 of a path starting in r — Rq and tp = 0. 



2.3 Heuristics 2: Random Poincare maps 

The second key ingredient of our analysis are Markov chains describing Poincare maps of 
the stochastic system. Choose an initial condition (Ro, 0), and consider the value -Ri = r Tl 
of r at the time 

ri =inf{t > 0: (pt = 1} , (2.20) 

when the sample path first reaches ip = 1 (Figure [3]). Since we are interested in the 
first-passage time through the unstable orbit, we declare that whenever the sample path 
(rt, ipt) reaches r = 1 before <p = 1, then R\ has reached a cemetery state d, which it never 
leaves again. Successively, we define R n = r Tn , where r n = inf{t >0:^ = n}, n£N. 

By periodicity of the system in ip and the strong Markov property, the sequence 
(i?o, R\, ■ ■ ■ ) forms a Markov chain, with kernel K, that is, for a Borel set A, 

F Rn {R n+1 e A} = K{R n ,A) = / K(R n ,dy) for all n ^ , (2.21) 

J A 

where F x {R n S ^4} denotes the probability that the Markov chain, starting in x, is in A 
at time n. 

Results on harmonic measures [BAKS84] imply that K(x, dy) actually has a density 
k(x,y) with respect to Lebesgue measure (see also |Dah77ll"JK82llCZ87| for related results). 
Thus the density of R n evolves according to an integral operator with kernel k. Such 
operators have been studied, among others, by Fredholm |Fre03j . Jentzsch |Jenl2j and 
BirkhofT [Bir57]. In particular, we know that k has a discrete set of eigenvalues Ao, Ai, . . . 
of finite multiplicity, where Ao is simple, real, positive, and larger than the modules of all 
other eigenvalues. It is called the principal eigenvalue of the Markov chain. In our case, 
we have Ao < 1 due to the killing at the unstable orbit. 

Fredholm theory yields a decompositiord 

k(x,y) = X h (x)h*(y) + X 1 h 1 (x)h* 1 (y) + ... (2.22) 
2 If Ai has multiplicity m > 1, the second term in (|2.22[) has to be replaced by a sum with m terms. 
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where the hi and h* are right and left orthonormal eigenfunctions of the integral operator. 
It follows that 



tRo 



{R n G A] =: K n (R , A) = \%h (Ro) I h* (y) dy 

J A 



(2.23) 



Thus the spectral gap Aq — | Ai | plays an important role in the convergence of the distribution 
of R n . For times n satisfying n 3> (log(Ao/|Ai|)) _1 , the distribution of R n will have a 
density proportional to h^. More precisely, if 

*o(da) = (2.24) 

J Kw d y 

is the so-called quasistationary distribution, then the asymptotic distribution of the process 
R n , conditioned on survival, will be ttq, while the survival probability decays like Aq. 

Furthermore, the (sub-)probability density of the first-exit location ip T at n + s, with 
n G N and s G [0, 1), can be written as 



K n (R ,dy)Fy{ip T G ds} = \%h (R ) / G ds} dy 



1+ CP||i^l 



o 

(2.25) 

This shows that the distribution of the exit location is asymptotically equal to a period- 
ically modulated exponential distribution. Note that the integral appearing in (|2.25|) is 
proportional to the expectation of tp T when starting in the quasistationary distribution. 

In order to combine the ideas based on Markov chains and on large deviations, we will 
rely on the approach first used in [BG04], and decompose the dynamics into two subchains, 
the first one representing the dynamics away from the unstable orbit, and the second one 
representing the dynamics near the unstable orbit. We consider: 

1. A chain for the process killed upon reaching, at time r_, a level 1 — 5 below the 
unstable periodic orbit. We denote its kernel K s . By Assumption 12.21 the first-hitting 
location ip T _ will be concentrated near places s* + n where a translate 7 DO ( > + n) of 
the minimiser crosses the level 1 — 5. We will establish a spectral-gap estimate 
for K s (see Theorem I6.12p . showing that ip T _ indeed follows a periodically modulated 
exponential of the form 

P°{¥V_ G [tputpx +A]} ~ (A^ 1 e~ J ^y° 2 , (2.26) 

where J is periodic and minimal in points of the form s* + n. 

2. A chain for the process killed upon reaching either the unstable periodic orbit at r = 1, 
or a level 1 — 25, with kernel K u . We show in Theorem 16 . 71 that its principal eigenvalue 
is of the form 

= e - 2X + T +(l + 0{5)) . (2.27) 

Together with a large-deviation estimate, this yields a rather precise description of the 
distribution of <p T , given the value of ip T _ , of the form 

F^-{(p T G + A]} ~e- 2A + T + ( ^-^ ) exp{-^[/ 00 + 0(e- 2A+T +^-^-))]} , 

(2.28) 

where 1^ is again related to the rate function, and the term 0(e~ 2X+T+ ^ ip ~ LpT -' > ) can 
be computed explicitly to leading order. The double-exponential dependence of ()2.28|) 
on 2A + T + ((/9 — ip T _) is in fact what characterises the Gumbel distribution. 
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Q\+T+{x) 



A+T+ = 1 




FIGURE 4. The cycling profile x i— > Q\ + t + (x) for different values of the parameter A+X+, 
shown for x € [0, 3]. 

By combining the two above steps, we obtain that the first-exit distribution is given by 
a sum of shifted Gumbel distributions, in which each term is associated with a translate 
of the optimal path . 



2.4 Main result: Cycling 

In order to formulate the main result, we introduce the notation 



h^(f) 



e -2X+T+u n n u ) du 



(2.29) 



1 _ e -2X+T + 

for the periodic solution of the equation dh/ df = 2\ + T + h — D rr (l,ip), where 

D rr (l,<p)=g r (l,<p)g r (l,cp) T (2.30) 

measures the strength of diffusion in the direction orthogonal to the periodic orbit. 

Theorem 2.3 (Main result). For any sufficiently small 5, A > ; there exists a ctq > 
such that the following holds: For any ro sufficiently close to —1 and a < ctq, 

\\oga\-0(<p) + O(8) 



™°{t E If, f + A]} = C (o-)(\oVxa(<p)Q\^ 



X4-T4. 



l + 0(V c ^ /|logCT| ) +0(<5|log(5|) , (2.31) 

where c > and we use the following notations: 
• Qx+T+ ( x ) is periodic with period 1 and given be the periodicised Gumbel distribution 



Qx +T+ (x)= J2 A{\ + T + {n-x)) 



where 



A(x) = - exp{-2a; - - e 2x } . 

(2.32) 



9(f) satisfies 9(f + 1) = 9(f) + A + T + and is given by 



9(f) = X+T+f - - log 



1 §2 hP™(f) 



2" hP° r (s*) 2 

where s* denotes the value of f where the optimal path 7^ crosses the level 1 — 5 



(2.33) 
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• Ao is the principal eigenvalue of the Markov chain, and satisfies 

A = 1 - e~ H/a2 (2.34) 

where H > is close to the value of the rate function /(7oo)- 

• X&i'-f) is periodic in <p with period 1. It is proportional to F(ip + A) — F((p) for a 
probability distribution function F. 

• The normalising constant Co(a) is of order e~ H / u . 

The proof is given in Section [7J We now comment the different terms in the expres- 
sion (|2.3ip in more detail. 

• Cycling profile: The function Q\ + t + is the announced universal cycling profile. Re- 
lation (|2.31|) shows that the profile is translated along the unstable orbit proportionally 
to | log a | . The intuition is that this is the time needed for the optimal path to 
reach a a-neighbourhood of the unstable orbit where escape becomes likely. For small 
values of A+T+, the cycling profile is rather flat, while it becomes more and more 
sharply peaked as A + T + increases (Figured]). 

• Periodic prefactor: The periodic prefactor xa(^) is related to the subexponential 
behaviour of the first-exit distribution when starting near the unstable periodic orbit. 
Its expression contains a factor F(tp + A) — F(ip) = F n o {tp T g [<^ ; ip + A]}, where the 
initial distribution ttq is the quasistationary distribution near the unstable periodic 
orbit. In the linear case with switching [BG04J, it can be computed explicitly by using 
the reflection principle, with the result 

XA(fp) « = ^L^ A for small A . (2.35) 

We conjecture that this is still the case in the nonlinear situation (up to a multiplicative 
error of the form 1 + 0(8)). This would imply that 0(ip) is a natural parametrisation 
of the unstable orbit, in which the first-exit distribution is given by the cycling profile. 

• Principal eigenvalue: The principal eigenvalue Ao determines the slow exponential 
decay of the first-exit distribution. Writing (\o) v = e _v> l losA °l, we see that the expected 
first-exit location is of order 1 / 1 log Ao | — e H ^ a . This time plays the same role as 
Kramers's time for gradient systems (see |Eyr35[ IKra40j and e.g. [Berll] for a recent 
review of Kramers' law). One may obtain sharper bounds on Ao using, for instance, 
the Donsker-Varadhan inequality (DV76j . 

• Normalisation: The prefactor Co(o~) can be estimated using the fact that the first- 
exit distribution is normalised to 1. It is of the order | log Ao | — &~ H ' a ■ 

• Transient behaviour: The error term 0(e~ cip ^ loscT ^) describes the transient be- 
haviour when not starting in the quasistationary distribution. If the initial condition 
is concentrated near the stable periodic orbit, we expect the first-exit distribution to 
be bounded above by the leading term in (I2.3ip during the transient phase. 

• Dependence on a level 5: While the left-hand side of ()2.31j) does not depend on S 
and one would like to take the limit 5 — > on the right-hand side, this would require 
also to pass to the limit a — > since the maximal value ao depends on 5 (as it does 
depend on A). 

We conclude with some remarks on applications and possible improvements and ex- 
tensions of Theorem 12.31 
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• Spectral decomposition: In the proof presented here, we rely partly on large- 
deviation estimates, and partly on spectral properties of random Poincare maps. 
By obtaining more precise information on the eigenfunctions and eigenvalues of the 
Markov chain K u , one might be able to obtain the same result without using large 
deviations. This is the case for the linearised system (see Proposition 16. ip . for which 
one can check that the eigenfunctions are similar to those of the quantum harmonic 
oscillator (Gaussians multiplied by Hermite polynomials). 

• Residence-time distribution: Consider the situation where there is a stable peri- 
odic orbit surrounding the unstable one. Then sample paths of the system switch back 
and forth between the two stable orbits, in a way strongly influenced by noise intensity 
and period of the orbits. The residence-time distribution near each orbit is related to 
the above first-exit distribution jBG05| . and has applications in the quantification of 
the phenomenon of stochastic resonance. 

• More general geometries: In a similar spirit, one may ask what happens if the 
stable periodic orbit is replaced by a stable equilibrium point, or some other attractor. 
We expect the result to be similar in such a situation, because the presence of the 
periodic orbit is only felt inasmuch hitting points of the level 1 — 5 are concentrated 
within each period. 

3 Coordinate systems 
3.1 Deterministic system 

We start by constructing polar-like coordinates for the deterministic ODE (|2. 1|) . 



Proposition 3.1. There is an open subset T>\ = (-L, L) x S 1 of the cylinder, with L > 1, 
and a C 2 -diffeomorphism h:T>i—> T>q such that (12. ip is equivalent, by the transformation 
z = h(r,(p), to the system 

= f<p{r,(p) , 

where f r , f<p : T>\ — >• R satisfy 

/ r (r,^) = A + (r-l) + 0((r-l) 2 ), 

U(r,v) = ^ + 0((r-l) 2 ) (3.2) 

as r — > 1, and 

/ r (r,vp) = -A_(r + l) + 0((r + l) 2 ), 

U(r,v) = ^r + 0((r + l) 2 ) (3.3) 

as r —7- —1. Furthermore, fip(r,cp) is positive, bounded away from 0, while f r (r,(p) is 
negative for \r\ < 1 and positive for \r\ > 1. 

Proof: The construction of h proceeds in several steps. We start by defining h in a 
neighbourhood of r = 1, before extending it to all of T>\. 

1. We set h(l,(p) = T + (cp). Hence f(T + (ip)) = d r h(l,(p)f + T' + (ip)ip, so that ip = 1/2+ 
and r = whenever r = 1. 
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2. Let u+(0) be an eigenvector of the monodromy matrix U + (l, 0) with eigenvalue e A+T +. 
Then it is easy to check that 

u+fr) = e- x + T +vu + &,0)u + (0) (3.4) 

is an eigenvector of the monodromy matrix U+(tp + 1, p) with same eigenvalue e A+T +, 
and that ^ 

— u + {<p) = T + [A + (<p)u+(<p) - \ + u + (ip)] . (3.5) 

We now impose that 

h(r, ^ = T + (<p) + (r - l)u + &) + 0((r - l) 2 ) (3.6) 

as r — > 1. This implies that 

f(h(r, ip)) = f(T + (cp)) + (r - l)A + {<p)u + {y) + C((r - l) 2 ) , (3.7) 

which must be equal to 

i = [r' + M + u' + (<p)(r - 1) + 0((r - l) 2 )] ip + [u + (p) + 0((r - l))]r 
= T + [/(T+fc)) + (A^+te) - A+u+C^)) (r - 1) + 0((r - l) 2 )] <p 

+ [u+(p) + 0((r-l))]r. (3.8) 

Comparing with (|3.7p and, in a first step, projecting on a vector normal to u+(y) 
shows that 93 = 1/T+ + C((r - l) 2 ) + C(r(r - 1)). Then, in a second step, projecting 
on a vector perpendicular to f(T + ((p)) shows that r = A+(r — 1) + 0((r — l) 2 ), which 
also implies (p = 1/T+ + 0{(r - l) 2 ). 

3. In order to extend h(r, (p) to all of Pi, we start by constructing a curve segment Ao, 
connecting T + (0) to some point r_ (ip*) on the stable orbit, which is crossed by all 
orbits of the vector field in the same direction (see Figure [1]). Reparametrising T_ if 
necessary, we may assume that ip* = 0. The curve Ao can be chosen to be tangent to 
u + (0) in r + (0), and to the similarly defined vector it-(0) in T_(0). We set 

h(r,<p) = T_((p) + (r + + 0({r + l) 2 ) (3.9) 



as r — > — 1, which implies in particular the relations f|3. 3|) . 

The curve segment Ao can be parametrised by a function r 1— > h(r, 0) which is com- 
patible with (j3.7|) and (|3.9p . that is, 9 r /i(±l,0) = u±(0). We proceed similarly with 
each element of a smooth deformation {A^j^ggi of Ao, where A. v connects r + (<^) to 
r_((/?) and is tangent to u±(ip). The parametrisation r 1— > h(r,ip) of A^ can be chosen 
in such a way that whenever ip < ip', the orbit starting in h(r, ip) E S first hits A^/ at 
a point h(r',ip') with r' < r. This guarantees that 

' = ! f r ' V \ (3.10) 

with / r (r, 93) < for (r, 93) G 5. 

4. We can always assume that f v (r,ip) > 0, replacing, if necessary, ip by ip + S(r) for 

some function 5 vanishing in r = 1. □ 
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Remark 3.2. One can always use ip as new time variable, and rewrite (|3.1|) as the one- 
dimensional, non-autonomous equations 

dr f r (r, p) . f . 

Note, in particular, that 

F(±l, = T±A±(r T 1)) + 0((r T I) 2 ) • (3.12) 
3.2 Stochastic system 

We now turn to the SDE (|2.7p which is equivalent, via the transformation z = h(r, (p) of 
Proposition 13.11 to a system of the form 

dr t = fr(n, tpt] <?) dt + (rg r (r t , ip t ) dW t , ^ 
dp t = U(r t , <Pt\°) dt + crg^rt, tp t ) dW t . 

In fact, Ito's formula shows that / = (f r ,fip) T and g = (gr,5V>) T , where g r and g v are 
(1 x /c)-matrices, satisfy 

g(h(r,tp)) = d r h(r,ip)g r (r,ip) + d (p h(r,tp)g ip (r,(p) 

f(h(r,ip)) =d r h(r,(p)f r (r,<p)+d lp h(r,<p)f v> (r,<p) (3.14) 
+ id 2 [d rr /i(r, ip)g r gj(r, p) + 2d rip h(r, p)g r g^(r, p) + <9 w /i(r, ip)g v g£{r, cp)) . 

The first equation allows to determine g r and g^, by projection on d r h and c^/i. The 
second one shows that 

f r (r, (p; a) = /°(r, 99) + a 2 f^(r, ip) , 

/ v (r, <p; a) = /°(r, + a 2 /^(r, , (3.15) 

where and /° are the functions of Proposition 13.11 

A drawback of the system (|3.13p is that the drift term f r in general no longer vanishes 
in r = ±1. This can be seen as an effect induced by the curvature of the orbit, since 
/f-(±l, (p) depends on F'±(<p) and u'±(p). This problem can, however, be solved by a 
further change of variables. 

Proposition 3.3. There exists a change of variables of the form y = r + 0{a 2 ), leaving 
p unchanged, such that the drift term for dyt vanishes in y = ±1. 

Proof: We shall look for a change of variables of the form 

y = Y(r,<p) = r- f T 2 [A_( v ?)(r-l) + A + (^)(r + l)] , (3.16) 

where A±((p) are periodic functions, representing the shift of variables near the two peri- 
odic orbits. Note that y = 1 for 

r = l + 2a 2 A + (^) + 0(a A ). (3.17) 
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Using Ito's formula, one obtains a drift term for dyt satisfying 

f y (l, p) = f r (l + 2a 2 A + (ip) + 0(a^),p) [1 - a 2 [A„(ip) + A+(ip)}] 

-a 2 [2A' + (ip)f v (l,ip)+0(a 2 )] (3.18) 

- id 4 [4A' + (ip)g r (l, <p)g v (l, p) T + 2A'|(<^(1, <p)g v {\ t <p) T + 0(a 2 )] , 

where the terms 0{a 2 ) depend on A± and Aj_. Using (j3.2|) . we see that, in order that 
fy(l,tp) vanishes, A + (<p) thus has to satisfy an equation of the form 

A+A+M - -^-A' + (ip) + r(tp, A±(ip), A'±(ip)) - (r 2 b(cp, A±(tp))A+((p) = , (3.19) 

where r(cp, A±, A^_) = / 4 (1, 92) + 0(a 2 ). Note that 6(99, A) > is bounded away from zero 
for small a by our ellipticity assumption on g. A similar equation is obtained for A_ (<£>). 
If A = (A + , A_) and S = (A' + , A'_), we arrive at a system of the form 

a 2 B&, A)E' = -DE + AA + R(ip, A, S) , 

A' = S , (3.20) 
<p' = l. 

Here D denotes a diagonal matrix with entries 1 /T + and 1 /T_ , and A denotes a diagonal 
matrix with entries X±. The system (I3.20P is a slow-fast ODE, in which 3 plays the role 
of the fast variable, and (A, ip) are the slow variables. The fast vector field vanishes on a 
uniformly hyperbolic slow manifold of the form H = H*(A, ip), where 

E* ± (A,p) = T ± [X ± A ± (p) + fX±l,p)] +0{a 2 ) . (3.21) 

By Fenichel's theorem |Fen79| . there exists an invariant manifold E = E(p,A) in a a 2 - 
neighbourhood of the slow manifold. The reduced equation on this invariant manifold 
takes the form 

A' ± = T± [A± A ± M + #(±1, ip)] + 0(a 2 ) . (3.22) 

The limiting equation obtained by setting a to zero admits an explicit periodic solution. 
Using standard arguments of regular perturbation theory, one then concludes that the full 
equation (|3,22p also admits a periodic solution. □ 



4 Large deviations 

In this section, we consider the dynamics near the unstable periodic orbit on the level of 
large deviations. We want to estimate the infimum J„ of the rate function for the event 
r(<5) that a sample path, starting at sufficiently small distance 5 from the unstable orbit, 
reaches the unstable orbit at the moment when the angular variable has increased by p. 
Consider first the system linearised around the unstable orbit, given by 

A+T+ D rr (0,ip)\ (r\ 

-A+T+ J\pr) ' 1 • j 



-r) = 

dip \PrJ V 
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(We have redefined r so that the unstable orbit is in r = 0.) Its solution can be written 
in the form 



p r (ip) 



gA+T+o^o) gA+T+o^o) r e- 2 ^^'^ D rr (0,u)d 

Jtpo 








Pr(po) 



(4.2) 

The off-diagonal term of the above fundamental matrix can also be expressed in the form 



where 



h per (<p) 



e 2X+T +v 
1 _ e -2A+T+ 



h^(<p) 



ip+1 



-2X+T+U 



D rr (0, u) du 



(4.3) 



(4.4) 



is the periodic solution of the equation dh/ dtp = 2\ + T + h — D rr (0, <p). The expression fj4.3j) 
shows that for initial conditions satisfying r((fo) = — h pev (ipo)p r (po), the orbit (r((p),p r (ip)) 
will converge to (0,0). The stable manifold of the unstable orbit is thus given by the 
equation r = —hP ev (ip)p r . 

We consider now the following situation: Let (r(0),p r (0)) belong to the stable manifold. 
The orbit starting in this point takes an infinite time to reach the unstable orbit, and gives 
rise to a value Iqo of the rate function. We want to compare this value to the rate function 
J„ of an orbit starting at the same r(0), but reaching r = in finite time (p. 

Recall that the rate function has the expression 



i>jD( ls )^ s ds 



D rr p 2 r + 2D rip p r p v + D^p^ 



dip 



(4.5) 



However, p^ can be expressed in terms of r, <p and p r using the Hamiltonian, and is of 
order r 2 + p 2 . Thus the leading term in the rate function near the unstable orbit is D rr p 2 . 
As a first approximation we may thus consider 



I°{l) = \ J D rr p 2 dp. 



(4.6) 



Proposition 4.1 (Comparison of rate functions in the linear case). Denote by 1^ and 
1^ the minimal value of the rate function 1° for orbits starting in r(0) and reaching the 
unstable orbit in infinite time or in time ip, respectively. We have 



1 J 2 C -2A^, 

2 /iP er (0) 2 



l + 0(e 



-2X + T + ip 



(4.7) 



Proof: Let (r°,p®.)(u) be the orbit with initial condition (r(0),p r (0)), and {r l ,p\.)(u) the 
one with initial condition (r(0),p r (0) + q). Then we have by (|4.2p with po = 0, (|4.3p and 
the relation r(0) = -/i per (0)p r (0) 



r l (u) 
pl(u) 



-X+T+u )-L r (0) + q 

e" A + T +> r (0)+g) . 



X+T+u 



h per (0) 



-X+T+u 



h per (u) 



The requirement r^{(p) = implies 



/jP er (0) 



C>(e 



-2X+T+<p\ 



(4. 



(4.9) 



15 



Since the solutions starting on the stable manifold satisfy p®(u) = e 2X+T+U p r (0), we get 
pl(u) 2 - p° r (u) 2 = 2qp r (u) e ~ x + T + u +q 2 e~ 2X + T + u . (4.10) 
The difference between the two rate functions is thus given by 

r<p poo 

2(iJ-i°) = (2qp r (0)+q 2 ) / e~ 2X + T + u D rr (0,u) du - p r (0) 2 / e~ 2X + T + u D rr (0, u) du . 

J o 

(4.11) 

Using the relations (cf. (|4.3p ) 

f V e -2X+T+u D ^ ^ du = h per^ _ Q -2X + T +V h P«^ j 

JO 

;>oo 

/ e - 2A+T +" At (0, u) du = /i per (0) , 

JO 

poo 

e -2X + T + u D ^ n) dn = e -2A+T+^ ^per^ (412) 



and /i per (0)p r (0) = -r(0) = -6 yields the result. □ 

We can now draw on standard perturbation theory to obtain the following result for 
the nonlinear case. 

Proposition 4.2 (Comparison of rate functions in the nonlinear case). For sufficiently 
small 5, the infimum J„ of the rate function for the event T(5) satisfies 

U-Ioo = \? e~ 2 ^ [i + o(e" 2 ^) + 0(5)] . (4.13) 

Proof: Writing (r,ip,p r ,p v ) = 8(f,tp,p r ,p v ), we can consider the nonlinear terms as a 
perturbation of order 5. Since the solutions we consider decay exponentially, the stable 
manifold theorem and a Gronwall argument allows to bound the effect of nonlinear terms 
by a multiplicative error of the form 1 + 0(5). □ 



5 Continuous-space Markov chains 
5.1 Eigenvalues and eigenfunctions 

Let £cl be an interval, equipped with the Borel cr-algebra. Consider a Markov kernel 

K(x,dy) = k(x,y)dy (5.1) 

with density k with respect to Lebesgue measure. We assume k to be continuous and 
square-integrable. We allow for K(x,E) < 1, that is, the kernel may be substochastic. In 
that case, we add a cemetery state d to E, so that K is stochastic on E U d. Given an 
initial condition Xq, the kernel K generates a Markov chain (Xq, X\, . . . ) via 

F{X n+1 eA}= / F{X n £ dx}K(x,A) . (5.2) 
Je 

We write the natural action of the kernel on bounded measurable functions / as 

(Kf)(x):=E x {f(X 1 )}= [ k(x,y)f(y)dy. (5.3) 

Je 
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For a finite signed measure // with density m, we set 

( j iK){dy):=W{X 1 E dy} = (mK)(y)dy , (5.4) 

where 



(mK)(y) = / m(x) dx k(x, y) . (5-5) 
Je 

We know by the work of Fredholm [Fre03] that the integral equation 

(Kf)(x)-Xf(x)=g(x) (5.6) 

can be solved for any g, if and only if A is not an eigenvalue, i.e., the eigenvalue equation 

(Kh)(x) = Xh(x) (5.7) 

admits no nontrivial solution. All eigenvalues A have finite multiplicity, and the properly 
normalised left and right eigenfunctions /i* and h n form a complete orthonormal basis, 
that is, 

h* n (x)h m {x) dx = 5 nm and ^ h* n (x)h n (y) = 5(x - y) . (5.8) 

n 

Jentzsch |Jenl2j proved that if k is positive, there exists a simple eigenvalue Ao E (0, oo), 
which is strictly larger in module than all other eigenvalues. It is called the principal 
eigenvalue. The associated eigenfunctions ho and are positive. Birkhoff |Bir57] has 
obtained the same result under weaker assumptions on k. We call the probability measure 
7r given by 

h* {x)dx 

dx) = (5.9) 

Je h o(y) d y 

the quasistationary distribution of the Markov chain. It describes the asymptotic distri- 
bution of the process conditioned on having survived. 

Given a Borel set A C E, we introduce the stopping times 

t a = t a {x) = inf {* ^ 1 : X t E A} , 

CTA = aj4 ( x ) = inf{t ^ 0: Z t E A} , (5.10) 

where the optional argument x denotes the initial condition. Observe that ta(x) = cta(x) 
if x E E \ A while cta(x) = < 1 ^ ^(x) if a; E A. The stopping times and a a may 
be infinite because the Markov chain can reach the cemetery state before hitting A (and, 
for the moment, we also don't assume that the chain conditioned to survive is recurrent). 
Given u E C , we define the Laplace transforms 

G A (x)=E*{e^l {TA<oc} } , 

H%(x)=E*{e^l {aA<oo} } . (5.11) 

Note that G\ = H\ in E \ A while H\ = 1 in A. The following result is easy to check by 
splitting the expectation defining G A according to the location of X\\ 



Lemma 5.1. Let 



j(A) = sup K(x,E\A)= sup ¥ x {X ± e E\A} . (5.12) 

x£E\A x£E\A 



17 



Then G A (x) is analytic in u for Ren < log 7(A) , i.e., for |e "| > "f(A), and for these 
u it satisfies the bound 

sup \G%(x)\ s$ 1 —. 1 7-tt . (5.13) 

The main interest of the Laplace transforms lies in the following result, which shows 
that H A is "almost an eigenfunction" , if G U A varies little in A. 

Lemma 5.2. For any u £ C such that G\ and K A exist, 

KH\ = e~ u G u A . (5.14) 
Proof: Splitting according to the location of X\, we get 

{KH\){x) = E-{E^{e^ l W<oo} }} 

= E*{l {Jfl6A} E*{e"^ l {CTA <oo}}} +E x [l {XieEXA} K x ^{e u ^ 1 Wa<oo} }} 
= ^{1^=1}} +E*{e"(-- 1 ) 1 {1<ta<oo} } 

= E-je"^" 1 ) l {rA <oo}} = e~ M G\{x) . (5.15) 

□ 

We have the following relation between an eigenfunction inside and outside A. 

Proposition 5.3. Let h be an eigenfunction of K with eigenvalue A = e~ u . Assume there 
is a set A C E such that 

\e~ u \ > 7(A) = sup F x {X t GE\A} . (5.16) 

xeE\A 

Then 

h(x) = E x {e UTA h(X TA )l {TA<oo} } (5.17) 

for all x 6 E. 

Proof: The eigenvalue equation can be written in the form 

e~ u h(x) = (Kh)(x) =E x {h(X 1 )l {XieA} } +E*{h(X 1 )l {XieEXA} } . (5.18) 

Consider first the case x £ E \ A. Define a linear operator T on the Banach space X of 
continuous functions / : E\A — > C equipped with the supremum norm, by 

(Tf)(x) =E*{e"h(X 1 )l {XieA} } +E*{e«f(X 1 )l {Xl&SKA} } . (5.19) 

Is is straightforward to check that under Condition (15.160 . T is a contraction. Thus it 
admits a unique fixed point in X, which must coincide with h. Furthermore, let h n be & 
sequence of functions in X defined by ho = and h n +\ = Th n for all n. Then one can 
show by induction that 

h n (x) = E x {e UT * h(X TA )l {TA<n} } . (5.20) 

Since limn^?^ h n {x) = h(x) for all x E E\ A, (I5.17P holds for these x. It remains to show 
that ()5.17p also holds for x £ A. This follows by a similar computation as in the proof of 
Lemma 15.21 □ 
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The following result provides a simple way to estimate the principal eigenvalue Aq- 



Proposition 5.4. For any n 1, and any interval A C E with positive Lebesgue measure, 
we have 



inf K n (x, A) 



1/n 



< An < 



sup K n (x, E) 



l/n 



(5.21) 



Proof: Since the principal eigenvalue of K n is equal to Aq , it suffices to prove the relation 
for n = 1. Let x* be the point where ho(x) reaches its supremum. Then the eigenvalue 
equation yields 

A = f k(x*,y)^-dy^K(x*,E), (5.22) 
which proves the upper bound. For the lower bound, we use 



A o / K(y) d v 



h* (x)K(x,A)dx > inf K(x,A) [ h* {y)dy, (5.23) 

xdA J A 

and the integral over A can be divided out since A has positive Lebesgue measure. □ 

The following result allows to bound the spectral gap, between An and the remaining 
eigenvalues. 

Proposition 5.5. Let A be an open subset of E. Assume there exists m : A — )• (0,oo) 
such that 

m(y) ^ k(x,y) sj Lm(y) Vx,y € A (5-24) 
holds with a constant L satisfying XqL > 1. Then any eigenvalue A ^ An of K satisfies 



A| < maxj 27(A) , X L - 1 + Pkm (A) + j(A) 



AnL-1 



1 + 



1 



An - l(A) 



(5.25) 



where 



j(A) = sup A"(x, £ \ A) and Pkm (A) = sup[l - K(x, E)] . (5.26) 



Remark 5.6. Proposition 15.41 shows that An ^ 1 — Pkm(A) — 7(A). Thus, if A is chosen 
in such a way that j(A) and pkiii(A) are small, the bound (15. 25ft reads 

|A| ^ L - 1 + 0{^{A)) +0{ Pkm (A)) . (5.27) 

Proof: The eigenvalue equation for A and orthogonality of the eigenfunctions yield 

Xh(x)= [ k(x,y)h(y)dy , (5.28) 
0- / h* (y)h(y)dy . (5.29) 



For any k > we thus have 

Xh(x) 



[ K K(y) - k ( x ,y)]h(y) &y ■ 



(5.30) 



Let xq be the point in A where \h\ reaches its supremum. Evaluating the last equation in 
xq we obtain 



|A| ^ / \nho(y) - k(x ,y)\dy+ / Kh* (y) + k(x ,y) 
' ■ Je\a l 



l%)l 
\h(x )\ 



dy 



(5.31) 
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We start by estimating the first integral. Since for all y E A, 



V*o(y) = / K(x)k(x,y)dx > m(y) / /io(x)dx, 
Je J A 



(5.32) 



choosing k = XqL(J a hy(x) dx) 1 allows to remove the absolute values so that 

\Kh* (y)-k(x ,y)\dy ^ X L-K(x ,A) ^ X L - 1 + ^(A) + Pkm (A) . (5.33) 



From now on, we assume |A| ^ 2j(A), since otherwise there is nothing to show. In order 
to estimate the second integral in (|5.3ip . we first use Proposition 15.31 with e~ u = A and 
Lemma 15.11 to get for all x E E \ A 



\h(x)\ ^E*{e^^\h(X TA )\l {TA<oo} } ^ 
The second integral is thus bounded by 



1 



E\A 



nho(y) + k(x ,y) 



dy^ 



1 



\Hx )\ 
\X\-j(A) ■ 

K(y) dy 

E\A 



(5.34) 



+ l(A) 



(5.35) 



Now the eigenvalue equation for Aq yields 



f h* (y)dy= [ h* (x)K(x,E\A)dx < j(A) [ h* (x)dx. (5.36) 
Je\a Je Je 



Ao / h 

>E\A 

Hence the second integral can be bounded by 

1 



|A|-704) 
Substituting in (j5.31 j) . we thus get 



|A| < X L- l + j(A) + mn (A) + 



A ° L CT) + ^ 
7(A) 



\X\-j(A) 



1 + 



XnL 



Ao - 7(A) 



(5.37) 



(5.38) 



Now it is easy to check the following fact: Let |A|,a,/3,7 be positive numbers such that 
a, |A| > 7. Then 


|A| < a+ . => |AKa + — —. (5.39) 



|A|-7 



a — 7 



This yields the claimed result. 



□ 



5.2 Harmonic measures 

Consider an SDE in ]R 2 given by 

dx t = f(x t ) dt + ag(x t ) dW t 



(5.40) 



where (Wt)t is a standard /c-dimensional Brownian motion, k ^ 2, on some probability 
space (fi, J 7 , P). We denote by 



2 o 2 2 o2 

i=l *J=i 



(5.41) 
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the infinitesimal generator of the associated diffusion. Given a bounded open set Pel 2 
with Lipschitz boundary dT>, we are interested in properties of the first-exit location 
x T G dV, where 

t = t v = bai{t > 0: x t V} (5.42) 

is the first-exit time from T>. We will assume that / and g are uniformly bounded in T>, 
and that g is uniformly elliptic in D. Dynkin's formula and Riesz's representation theorem 
imply the existence of a harmonic measure H(x,dy), such that 

¥ x {x T eB}= [ H(x,dy) (5.43) 
Jb 

for all Borel sets B C &D. Note that x \-t H(x,dy) is £-harmonic, i.e., it satisfies Ch = 
in T>. The uniform ellipticity assumption implies that for all x G T>, 

H(x,dy) = h{x,y)dy (5.44) 

admits a density h with respect to the arclength (one-dimensional surface measure) dy, 
which is smooth wherever the boundary is smooth. This has been shown, e.g., in [BAKS84J 
(under a weaker hypoellipticity condition). 

We now derive some bounds on the magnitude of oscillations of h, based on Harnack 
inequalities. 

Lemma 5.7. For any setT>o such that its closure satisfies T>q C T>, there exists a constant 
C, independent of a, such that 

sup h(x,y) 

xeVo ^ e°^ 2 (5.45) 



inf h(x,y) 



holds for all y € &D. 



Proof: Let B be a ball of radius R = a 2 contained in V . By |GT0H Corollary 9.25], we 
have for any y £ dT> 

sup h(x,y) ^ Co inf h(x,y) , (5.46) 

where the constant Co ^ 1 depends only on the ellipticity constant of g and on vR 2 , 
where the parameter v is an upper bound on (||/||/o" 2 ) 2 - Since R = a 2 , Co does not 
depend on a. Consider now two points xi,X2 G V. They can be joined by a sequence of 
N = \\\x2 — x\ || /<r 2 ] overlapping balls of radius a 2 . Iterating the bound (|5.46p . we obtain 

h{x 2 ,y) < C$h{xi,y) = ^c )\\\x 2 - Xl \\/^ h ( Xuy ) _ (5.47) 
which implies the result. □ 

Lemma 5.8. Let B r {x) denote the ball of radius r centred in x, and let T>q be such that 
its closure satisfies T>q C T>. Then, for any xq G T>o,y G dT> and 77 > 0, one can find a 
constant r = r(y,rj), independent of a, such that 

sup h(x, y) ^ (1 + 77) inf h(x,y) . (5.48) 
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Proof: Let r be such that B T ^(xq) C V , and write Rq = r a 2 . Since h is harmonic 
and positive, we can apply the Harnack estimate [GTOlj, Corollary 9.24], showing that for 
any R ^ R , 

( R \ a 

osc h := sup h(x,y) — inf h(x, y) ^ C% I — ) osc /i , (5.49) 

where, as in the previous proof, the constants C\ ^ 1 and a > do not depend on a. 
By [GTOll Corollary 9.25], we also have 

sup h(x,y) ^ C*2 inf h(x,y) , (5.50) 

where again C 2 > 1 does not depend on <r. Combining the two estimates, we obtain 
sup h(x, y) osc h a 

xeB R (x ) x£B Ro (x ) 

The result thus follows by taking r = R/a 2 , where R = Ro[r)/(Ci(C 2 - 1))] 1/q - □ 



5.3 Random Poincare maps 

Consider now an SDE of the form (|5.40p . where x = (ip, r) and / and g are periodic in if, 
with period 1. Consider the domain 

V = {(ip,r): - M < if < 1 , a < r < b) , (5.52) 

where a < b, and M is some (large) integer. We have in mind drift terms with a positive 
93-component, so that sample paths are very unlikely to leave T> through the segment 
(p = -M. 

Given an initial condition xq = (0, vq) G T>, we can define 

fc(r ,ri) = ft((0,ro),(l,ri)) , (5.53) 

where h(x, y) Ay is the harmonic measure. Then by periodicity of / and g and the strong 
Markov property, k defines a Markov chain on E = [a, b], keeping track of the value R n of 
rt whenever ipt first reaches n G N . This Markov chain is substochastic because we only 
take into account paths reaching cp = 1 before hitting any other part of the boundary of 
T>. In other words, the Markov chain describes the process killed upon rt reaching a or b 
(or (ft reaching — M). 

We denote by K n (x,y) the ra-step transition kernel, and by k n (x,y) its density. Given 
an interval A C E, we write K^(x, y) = K n (x, y)/K n (x, A) for the n-step transition kernel 
for the Markov chain conditioned to stay in A, and k^(x, y) for the corresponding density. 

Proposition 5.9. Fix an interval Ac E. For x\, x 2 G A define the integer stopping time 

N = N(xi,x 2 ) =inf{n^ 1: \X^ - X^\ < r v a 2 } , (5.54) 

where r v = r{y,rj) is the constant of Lemma \5.& and X^° denotes the Markov chain with 
transition kernel K A (x,y) and initial condition xq. The two Markov chains X* 1 and X^ 2 
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are coupled in the sense that their dynamics is derived from the same realization of the 
Brownian motion, cf. (|5.40p . 
Let 

p n = sup P{N(xi,x 2 ) > n] . (5.55) 

xi,X2&A 

Then for any n ^ 2 and n > 0, 

sup k£(x,y) 

X&A tuA( : < 1 + r? + Pn-i e^ 2 (5.56) 
mf fc n (x, y) 

ZioWs /or all y £ A, where C does not depend on a. 
Proof: We decompose 

n-1 

F{X^ G dy} = ^2F{X^ e dy\ N = k}¥{N = k) 

k=l 

+ ¥{X^ Edy\N>n-l}F{N >n-l} . (5.57) 

(2) 

Let kn ((a^i, X2), (zi, z 2 ) \ N = k) denote the conditional joint density for a transition for 
(X® 1 , Xf 2 ) in n steps from (xi,x 2 ) to (zi,z 2 ), given N = k. Note that this density is 
concentrated on the set {\z 2 — zi\ < r v a 2 }. For k = 1, ... , n — 1 and any measurable 
-B C E, we use Lemma 15.81 to estimate 



G B | N = k) 

= [ [ F{X^ k £B}k£\(x 1 ,x 2 ),(z 1 ,z 2 )\N = k)dz 2 dz 1 
J A J A 

+ / P{X^ fc GB}^((x 1 ,x 2 ),(z 1 ,z 2 )|iV = A : )dz 2 dz 1 



A J A 



= (1 + r/)P{X* 2 G S | JV = k} . (5.58) 

Writing k n -i(xi, z\ \ N > n—1) for the conditional (n — l)-step transition density of Xf 1 , 
the last term in (]5.57p can be bounded by 

¥{X^ eB\N>n-l}= [ ¥{X^ G B}k n _ x {x 1 , z x \ N > n - 1) dz ± 

J A 

^ sup P{Xf G B^X* 1 ^ G E | iV > n - l} 

2lGA 

< sup ¥{X^ G 5} . (5.59) 

zjeA 



We thus have 



'{X* 1 G dy} < (1 + r?)P{^' 2 G dy} + p n _i sup P{Xf G dy} . (5.60) 

ziGA 



On the other hand, we have 



¥{X^ G dy} > P{X%_ 1 G A} inf P{Xf G dy} . (5.61) 

Combining the upper and lower bound, we get 

sup kn(x,y) supk A (z,y) 

^—j < 1 + V + Pn-i Z -^— A • (5.62) 

inf ' k n (x, y) mf \k (z,y) 
xeA zeA 

Hence the result follows from Lemma 15.71 □ 
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6 Sample-path estimates 



6.1 The principal eigenvalue A" 

We consider in this section the system 



dr t = [X+r t + b r (r t , (ft)] dt + ag r (r t , ip t ) dW t , 

— + b ip (r t ,<p t ) d.t + ag^(r t ,ipt)dW t , 
LI4. j 



dip 



(6.1) 



describing the dynamics near the unstable orbit. We have redefined r in such a way that 
the unstable orbit is located in r = 0, and that the stable orbit lies in the region {r > 0}. 
Here {Wt}j is a fc-dimensional standard Brownian motion, k ^ 2, and g = (gj,g^) 
satisfies a uniform ellipticity condition. The functions b r , b v , g r and g v are periodic in p 
with period 1 and the nonlinear drift terms satisfy | b r (r, ip) \ , \b v (r, ip) | ^ Mr 2 . 

Note that in first approximation, ipt is close to t/T + . Therefore we start by considering 
the linear process rf defined by 

dr° = Ar t ° dt + ag (t) dW t , (6.2) 

where go(t) = g r (0,t/T + ), and A will be chosen close to A + . 

Proposition 6.1 (Linear system). Choose a T > and fix a small constant 5 > 0. Given 
ro G (0,5) and an interval A C (0,(5), define 

P(r ,A) = P r °{0 <r° t < <5Vt € [0,T],r T G A} , 

and let 



I 

Jo 



-2Xs 



g (s)g (s) T ds forte[0,T] 



(6.3) 
(6.4) 



1. Upper bound: For any T > 0, 

1 5 2 r 



P(r Q ,(0,8)) ^ 



2vr a 3 3/2 



e -r2/2a 2 i; T g -2AT 



r e 



-2AT 



4„2 



(6.5) 



Lower bound: Assume A = [a a, ab\ for two constants < a < b. Then there exist 
constants Co, C\, c > 0, depending only on a, 6, A, smc/i i/tai /or am/ ro G i and T ^ 1, 

-c5 2 /a 2 ■ 



P(r ,A) > Cq-CxT 



S 2 



-2 AT 



(6.6) 



Proof: We shall work with the rescaled process zt = e A *r°, which satisfies 

dz t = e- xt ag (t)dW t . (6.7) 
Note that z t is Gaussian with variance a 2 Vf. Using Andre's reflection principle, we get 
P(r ,(0,<5)) ^ F r °{z t >0Vt£ [0,T],0 < z T < 5e' XT } 

= P ro {0 < z T < 5e~ XT } - ¥- ro {0 <z T < 5e~ XT } 



1 f Se ~ 

TTCr 2 VT Jo 



\j2na 2 VT Jo 
2 



-(r ~z) 2 /2cr 2 v T _ -(r +z) 2 /2a 2 v T 



dz 



y / 2ira 2 VT 



e -r 2 /2a 2 v T 



sinh 



\a z vr J 



dz 



(6.8) 
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and the upper bound (|6.5|) follows by using cosh(u) — 1 = \u 2 + 0(u A ). 

To prove the lower bound, we introduce the notations tq and ts for the first-hitting 
times of rt of and 5. Then we can write 

P(r ,A) = P r °{r >T,r T e A} -¥ ro {rs < T < r , r T 6 A) . (6.9) 

The first term on the right-hand side can be bounded below by a similar computation as 
for the upper bound. Using that r$ is of order o~, that vt has order 1 for T ^ 1, and taking 
into account the different domain of integration, one obtains a lower bound Cq&~ 2XT . As 
for the second term on the right-hand side, it can be rewritten as 

<TAr }lF" 5 {' r o > T - t s , r T - TS 6 A} | . (6.10) 

By the upper bound (I6.5j) . the probability inside the expectation is bounded by a constant 
times e - 2 M T ~ r s) e ~c6 /a jgz rema i ns t estimate E r ° {^{ ts <tat } e 2Xrs }• Integration 
by parts and another application of (16.5P show that this term is bounded by a constant 
times T, and the lower bound is proved. □ 



Remark 6.2. 

1. The upper bound (16. 5j) is maximal for tq = a^/vr, with a value of order (5 2 /<j 2 vt) e~ 2XT . 

2. Applying the reflection principle at a level —a instead of 0, one obtains 

P r »{-ae Ai < rf < SVt G [0,T]} ^ Cp ^T ( 6 - n ) 

k J (T z Vt 

for some constant Co (provided the higher-order error terms are small). 

We will now extend these estimates to the general nonlinear system (|6.ip . We first 
show that ipt does not differ much from t/T + on rather long timescales. To ease the 
notation, given h,h\ > ,we introduce two stopping times 



T h = inf {t >0:r t ^h}, 

t 



r 9 = inf <; t > 0: 



ft 



^ Mitft + hA \ . (6.12) 



Proposition 6.3 (Control of the diffusion along <p). There is a constant C\, depending 
only on the ellipticity constants of the diffusion terms, such that 

P (ro ' 0) K < A T} < e -hV(C lh W) (6 13) 

/ioWs /or a// T, <r > and all h, hi > 0. 
Proof: Just note that rjt = (ft — t/T+ is given by 



Tt = / b ip {r s ,(p s )ds + a I g v (r s ,tp s )dWs . (6.14) 

For t < r^, the first term is bounded by Mh 2 t, while the probability that the second one 
becomes large can be bounded by the Bernstein-type estimate Lemma lA.ll □ 
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In the following, we will set h% = V h 3 T. In that case, h 2 t + hi < h(l + 2hT), and 
the right-hand side of (|6.13p is bounded by e~ h ^ ClCT ). All results below hold for all a 
sufficiently small, as indicated by the cr-dependent error terms. 

Proposition 6.4 (Upper bound on the probability to stay near the unstable orbit). Let 
h = a 1 for some 7 G (1/2, 1), and let fi > satisfy (1 + 2/i)/(2 + 2/x) > 7. Then for any 
< r < h and all <T ^ l/h, 

r°{0 < r t < h Vt G [0,T A r,]} < exp{-A + T " °(|^)] } ■ ( 6 - 15 ) 

Proof: The proof is very close in spirit to the proof of [BG06, Theorem 3.2.2], so that 
we will only give the main ideas. The principal difference is that we are interested in the 
exit from an asymmetric interval (0, h), which yields an exponent close to 2A instead of 
A for a symmetric interval (—h,h). To ease the notation, we will write A instead of A+ 
throughout the proof. 

We introduce a partition of [0, T] into intervals of equal length A/A, for a A to be 
chosen below. Then the Markov property implies that the probability (|6.15|) is bounded 
by 

/*Y-1 / x,r>g(g(A) 1 )\ (r -1 r>\ 

q(A) expi -XT i , (6.16) 

where q(A) is an upper bound on the probability to leave (0, h) on a time interval of length 
A/A. We thus want to show that log(g(A) _1 )/A is close to 2 for a suitable choice of A. 
We write the equation for rt in the form 

dr t = [Xr t + b(r t ,<pt)] dt + ag {t)dW t + ag 1 (r t ,(pt,t)dW t ■ (6.17) 

Note that for \r%\ < h and t < t v A T, we may assume that gi(rt, (ft,t) has order h + h?T, 
which has in fact order h since we assume T ^ l/h. Introduce the Gaussian processes 



rt = r e x±t +ae x±t f e~ x±s g (s) dW s , (6.18) 
J 

where A ± = A ± Mh. Applying the comparison principle to rt — rf, we have 

^ + a e A ~* M^T < r t < r+ + a e A+t M\ (6.19) 
as long as < rt ^ h, where M t are the martingales 

Mf = I e- x±s g 1 (r s ,< Ps ,s)dW s . (6.20) 
J 

We also have the relation 



e 2Mht 



rt+ae x+t M° t where M° t = [ [e x+s - e A s ]g (s) dW s . (6.21) 

Jo 



Using Ito's isometry, one obtains that J^q. has a variance of order h . This, as well as 

if 



Lemma lA. II in the case of Mf , shows that 



jupl.e-A.ri > H) « ^{- 2C J^, \ (6.22) 
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for some constant C\. Combining (|6.19|) and (|6.21|) . we obtain that < r t < h implies 



a e 



x+t Mt < r+ < e 2Mht [h + a e A ~* Mj] - a e x+t M° t . (6.23) 



The probability we are looking for is thus bounded by 

q(A) =F{-H< rt < e 2Mht [h + H] + H Vt G [0, A/A]} + 3 exp j - ^^aTaa } ' 

(6.24) 

The first term on the right-hand side can be bounded using (I6.1ip with a = H, yielding 



( U \ {e 2MhA/X ^ + ^ + F) -A +H 1 2 r " 2 



We now make the choices 



-A i_ ; x 1 + M,„A , .. , h 2 \ 



H = e~*h and A = log ^1 + fi + J . (6.26) 

Substituting in (|6,25p and carrying out computations similar to those in [BG06, Theo- 
rem 3.2.2] yields log(<?(A) _1 )/A > 2/i/(l + /i) - 0(l/|loger|), and hence the result. □ 

The estimate (|6.15p can be extended to the exit from a neighbourhood of order 1 of 
the unstable orbit, using exactly the same method as in [BGK12, Section D]: 

Proposition 6.5. Fix a small constant S > 0. Then for any k < 2, there exist constants 
o~o, a, C > and < v < 2 such that 

P r °{0 < r t < 5 Vt G [0,T]} < ^e- KA+T (6.27) 

/joWs /or a// ro G (0, J), a// a < o~o and a// T ^ cr _ly . 

Proof: The proof follows along the lines of [BGK121 Sections D.2 and D.3]. The idea is 
to show that once sample paths have reached the level h = a" 1 , they are likely to reach 
level 5 after a relatively short time, without returning below the level h/2. To control 
the effect of paths which switch once or several times between the levels h and h/2 before 
leaving (0,5), one uses Laplace transforms. 

Let 7~i denote the first-exit time of rt from (0, h), where we set t\ = T if rt remains in 
(0, h) up to time T. Combining Proposition 16.31 with h\ = V h 3 t and Proposition 16.41 we 
obtain 

P ro {n > t } < „ \ , e - KlA+t + e- 1 /^ 1 ^ 7 ) Vt G [0, T] , (6.28) 

where K\ = 2/i/(l + jj) — 0(l/|log<r|). The first term dominates the second one as long as 
v < 2 — 7. Thus the Laplace transform ~E{e u ( TlAT ^} exists for all u < 1/(kiA + ). 

Let T2 denote the first-exit time of n from (h/2, 5). As in [BGK12( Proposition D.4], 
using the fact that the drift term is bounded below by a constant times r, that {T2 > t} C 
{r t < 5}, an endpoint estimate and the Markov property to restart the process at times 
which are multiples of | log cr | , we obtain 



^{r 2 >t}<e Xp {-C 2 t72(1 _ T ; |ioga| } WG[0,T] 



(6.29) 
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for some constant C%. Therefore the Laplace transform K{e u ( T2AT ^} exists for all u of 
order 1/(<t 2 ( 1-7 ) |log o~|). In addition, one can show that the probability that sample paths 
starting at level h reach h/2 before 5 is exponentially small in 1/cr 2 '- 1 " 7 ^ . 

We can now use [BGK12J Lemma D.5] to estimate the Laplace transform of r = 
tq A ts A T, and thus the decay of P{t > t} via the Markov inequality. Given k = 2 — e, 
we first choose fi and o~o such that k\ ^ 2 — e/2. This allows to estimate E{e UT } for 
u = K\ — e/2 to get the desired decay, and determines a. The choice of \x also determines 
7 and thus □ 

Proposition 6.6 (Lower bound on the probability to stay near the unstable orbit). Let 
h = cr 7 for some 7 G (1/2, 1), and lei ^4 = [<ra, <r&] for constants < a < b. Then there 
exists a constant C such that 



{0 < r t < hVt G [0,T] , r T G A} ^ Cexp|-2A+T 1 + } 



(6.30) 



/ioWs /or all r^ £ A and allT ^ 1/h. 



Proof: Consider again a partition of [0, T] into intervals of length A/A+, and let o(A) 
be a lower bound on 

P ro {0 < r t < h Vi G [0, A/A+] , r A/A+ G ^} (6.31) 

valid for all ro G A. By comparing, as in the proof of Proposition 16.41 rt with solutions of 
linear equations, and using the lower bound of Proposition 16. 1} we obtain 

g(A) = d e- 2A -C 2 e~ c ^ 2 e ^ (6.32) 

for constants C%, C2, c > 0, where the second term bounds the probability that the martin- 
gales aAd^ exceed H = e _A h times an exponentially decreasing curve. By the Markov 
property, we can bound the probability we are interested in below by the expression (|6.16p . 
The result follows by choosing A = co|logcr| for a constant cq. □ 

We can now use the last two bounds to estimate the principal eigenvalue of the Markov 
chain on E = [0,25] with kernel K u , describing the process killed upon hitting either the 
unstable orbit at r = or level r = 25. 

Theorem 6.7 (Bounds on the principal eigenvalue AJJ). For any sufficiently small 5 > 0, 
there exist constants o"o,c > such that 

(1 - c5 2 ) e ~ 2A + T + < A# < (1 + c5 2 ) e~ 2X+T + (6.33) 

holds for all o < o~q. 

Proof: We will apply Proposition 15.41 In order to do so, we pick n G N such that 

T = TTTTptZ < 6 ' 34 > 



satisfies Proposition 16.51 and is of order a v with v < 2. Proposition 16.31 shows that with 
probability larger than 1 — e~ 



-5/(Cl<T 2 -T1 



+ M5 2 T for all t < T A r h (6.35) 
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for h = a 7 as before, with 7 > v. In particular, we have ipx ^ n. Together with 
Propositions 16.4 1 and 16.51 applied for n = 2 — 5 2 , this shows that for any rp € E 



K (r ,E) < ^exp{-(2-^) A+ _^±_ |> + e -W0 



1 + M<5 2 T + 
2/i 



(9 



(6.36) 



1 + H \| logo- 
Using log (a + b + c) ^ log 3 + maxjlog a, log b, log c} and the fact that v < 2, we obtain 

ilogi^(r ,£) 
n 

Since n has order o~ y , we can make a small enough for all error terms to be of order 5 2 . 
Choosing first /i, then the other parameters, proves the upper bound. The proof of the 
lower bound is similar. It is based on Proposition 16.61 a basic comparison between tt and 
the value of rt at the time t when <pt reaches n, and the lower bound in Proposition E3J D 

6.2 The principal eigenvalue Aq and the spectral gap 

We consider in this section the system 

dr t = [-X-r t + b r (r t , ip t )] dt + ag r (r u tp t ) dW t , 
dipt 



+ b^n, (ft) dt + ag 9 (r t , (ft) dW t , (6.38) 



describing the dynamics away from the unstable orbit. We have redefined r in such a way 
that the stable orbit is now located in r = 0, and that the unstable orbit is located in 
r = 1. 

In what follows we consider the Markov chain of the process killed upon reaching level 
1 — 5 where 5 G (1/2,1), whose kernel we denote K s . The corresponding state space is 
given by E = [-L, 1 — 5] for some L 1. 

Proposition 6.8 (Lower bound on the principal eigenvalue Ag). There exists a constant 
k > such that 

Aq ^ 1 — e- K/a2 . (6.39) 

Proof: Let A = [— h, h] for some h > 0. If h is sufficiently small, the stability of the 
periodic orbit in r = implies that any deterministic solution starting in (tq , 0) with 
ro E A satisfies |ri| ^ hi < h when it reaches the line ip = 1 at a point (n,l). In 
fact, by slightly enlarging hi we can ensure that |r^| ^ h\ < h whenever ipt is in a small 
neighbourhood of 1. Using, for instance, [BG061 Theorem 5.1.18]Jf] one obtains that the 
random sample path with initial condition (t*o,0) stays, on timescales of order 1, in a ball 
around the deterministic solution with high probability. The probability of leaving the 
ball is exponentially small in 1/a 2 . This shows that K{x,A) is exponentially close to 1 
for all x G A and proves the result, thanks to Proposition 15.41 □ 



3 This might seem like slight overkill, but it works. 
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Note that in the preceding proof we showed that for A = [—h, h], there exist constants 
C\ , k > such that 

sup K(x, E\A)+ p m (A) ^ d e~ K/a2 . (6.40) 
xeA 

The following proposition gives a similar estimate allowing for initial conditions x G E. 

Proposition 6.9 (Bound on the "contraction constant"). Let A = [—h, h\. For any h > 0, 

there exist n\ S N and constants C\,k > such that 

7 ni (A) := sup K ni (x, E\A)^d e~ K ^ 2 . (6.41) 

x&E 

Proof: Consider a deterministic solution xf ct = (rf et ,ipf et ) with initial condition xo = 
(ro,0). The stability of the orbit in r = implies that xf et will reach a neighbourhood of 
size h/2 of this orbit in a time T of order 1. By [BG06, Theorem 5.1.18], we have for all 
t ^ 

P x °( sup \\x t - xf ct \\ > ho \ ^ C (l + t)e- Koh °/ a2 (6.42) 

for some constants Cq,kq > 0. The estimate holds for all ho ^ hi/x(t), where h\ is 
another constant, and \(t) is related to the local Lyapunov exponent of xf et . Though 
x(t) may grow exponentially at first, it will ultimately (that is after a time of order 1) 
grow at most linearly in time, because xf et is attracted by the stable orbit. Thus we have 
x(T) ^ 1 + CT for some constant C. Applying ()6.42p with ho = h/2, we find that any 
sample path which is not killed before time n\ close to T will hit A with a high probability, 
which yields the result. □ 

Let X® 1 and X^ 2 denote the values of the first component r% of the solution of the 
SDE (|6.38p with initial condition x\ or X2, respectively, at the random time at which (ft 
first reaches the value 1. Note that both processes are driven by the same realization of 
the Brownian motion. 

Proposition 6.10 (Bound on the difference of two orbits). There exist constants ho, c > 
and p < 1 such that 

P{|Xf - X^\ > p\x 2 - xi|} < e"^ 2 (6.43) 
holds for all x\, X2 £ A = [— ho, ho]. 

Proof: Let (£t,Vt) denote the difference of the two sample paths started in (xi,0) and 
(x2,0), respectively. It satisfies a system of the form 

d& = -A_6 dt + m) dt + o-gxitum) dW t , 

drjt = 6 2 (6, Vt) dt + ag 2 (Ct,r]t) dW t , (6.44) 

with initial condition (£o, 0), where we may assume that £o = x 2 — x\ > 0. Here \bi(£, r/)| ^ 
M(£ 2 + rj 1 ) and \gi{^,i])\ ^ ^(|C| + \v\) f° r i = 1, 2 (remember that both solutions are 
driven by the same Brownian motion). Consider the stopping times 

r e = inf{i > 0: & > H e" A -*} , 

tv, = inf{t > 0: > h} , (6.45) 
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where we set H = £g + h = Lh. Writing r/ t in integral form and using Lemma IA.ll we 
obtain 

< r ? A 2} sC e- Cl/a2 (6.46) 

for some constant c\ > 0, provided h is smaller than some constant depending only on M 
and L. In a similar way, one obtains 

P{r e < t v A 2} < e- C2,a2 (6.47) 

for some constant c 2 > 0, provided H is smaller than some constant depending only on M. 
It follows that 

P{r ? AT,,a}<e" c/ff2 (6.48) 

for a c > 0. Together with the control on the diffusion along ip (cf. Proposition I6.3[) . we 
can thus guarantee that both sample paths have crossed ip = 1 before time 2, at a distance 

|Xf - Xf| < e- A -/ 2 (£ + h) (6.49) 

with probability exponentially close to 1. For any p E (e~ A ~/ 2 , 1), we can find h such that 
the right-hand side is smaller than p£o = p\ x 2 — x i\- This yields the result. □ 

From (|6.43p . we immediately get 

F{\X^ -X?\ > p n \x 2 -xi|} < ne- c/CT2 Vn > 1 . (6.50) 

Fix a > and let TV" be the integer stopping time 

N = N( Xl ,x 2 ) = inf{n > 1: |X* 2 - < aa 2 } . (6.51) 

If no is such that p n °diam(A) ^ a<r 2 , then (|6.50|) implies F{N > no} ^ noe _C// °" 2 whenever 
xi,X2 G -A. Using Proposition 16.91 and the Markov property, we obtain the following 
improvement. 

Proposition 6.11 (Bound on the hitting time of a small ball). There is a constant C 2 
such that for any k ^ 1 and all x±,x 2 G A, we have 

F{N(xi,x 2 ) > kn | X?\X* 2 G AVI < A;n } < (C 2 |log(afj 2 )| e - K2/,j2 ) fc , (6.52) 
where k 2 = c. 

Proof: By the definition of no and (|6.40p . for any x%,x 2 G A we have 

F{N > n } 



"{AT > n I Xf\Xf 2 g AV/ < n } 



P{Xf\Xf 2 eiVK n } 



< -7-2 < 2n o e 

1 - n Ci e~^ /CT 

Thus the result follows by applying the Markov property at times which are multiples of 
no, and recalling that uq has order | log (acr 2 ) |. □ 

Combining the last estimates with Proposition 15.51 we finally obtain the following 
result. 
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Theorem 6.12 (Spectral gap estimate for K s ). There exists a constant c > such that 
for sufficiently small a, the first eigenvalue of K s satisfies 



| A J| ^ e -c/|log<r| ( 6>53 ) 

Proof: We take n = k(no + n\), where k will be chosen below. Fix h > and set 
A = (—h,h). We apply Proposition 15.51 for the Markov chain K^, conditioned on not 
leaving A, with m(y) = mi x ^A k n (x, y), which yields 



f (\ s ) n T 
|Ai|» < max|2 7 «(^) , (A S )"L - 1 + 7 "(A) ^L_ 



1 

1 + 



(A s )" - T(A) 



(6.54) 



Proposition 16.91 shows that "f n (A) ^ 7 n ([— h/2, h/2]) is exponentially small, since n ^ n\. 
Proposition 16.81 shows that (Ag) n is bounded below by 1 — ne _K//<T . It thus remains to 
estimate L. Proposition 15.91 shows that 

T . 1 + V + sup gl ,x 2 gA F{iV(x 1 , x 2 ) > n - 1} e c ^ 2 

L ^ ~c T7~r T\ i (6.55) 

mi x( z A K n {x,A) 

where the parameter a in the definition of the stopping time N is determined by the choice 
of rj. We thus fix, say, rj = 1/4, and k = \C / n<z \ + 1. In this way, the numerator in (I6.55P 
is exponentially close to 1 + rj. Since n has order | log <r | , the denominator K n _i(x, A) is 
still exponentially close to 1, by the same argument as in Proposition 16.81 Making a small 
enough, we can guarantee that L — 1 ^ 3/8 and (AQ) n L > 1, and thus |Ai| n ^ 1/2. The 
result thus follows from the fact that n has order | log <r | . □ 



7 Distribution of exit locations 

We can now complete the proof of Theorem 12.3^ which is close in spirit to the proof 
of [BG041 Theorem 2.3]. We fix an initial condition (ro,0) close to the stable periodic 
orbit and a small positive constant A. Let 

P A (p) =F ro >°{ip T _ e + A]} (7.1) 

be the probability that the first hitting of level 1 — 5 occurs in the interval [<pi,<p\ + A]. 
If we write ip\ = k + s with k G N and s G [0, 1), we have by the same argument as the 
one given in (|2,25p . 

P A (k + s) = C(r )(\ s ) k r«{<p T _ e[s,s + A]} 

where C(ro) is a normalising constant, ttq is the quasistationary distribution for K s , and 

P*Hpt- e [s,s + A]} = J Tr s (r)P r {<p T _ G [s, s + A] } dr . (7.3) 

Note that 7Tq is concentrated near the stable periodic orbit. By the large deviation principle 
and our assumption on the uniqueness of the minimal path 700, ¥ r {ip T _ G [s, s + A]} is 
maximal at the point s* where 7oo crosses the level 1 — 5, and decays exponentially fast in 
l/<7 2 away from s*. In addition, our spectral gap estimate Theorem 16.121 shows that the 



l + O 



(7.2) 
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error term in (|7.2[) has order 5 as soon as k has order | log cr| | log <5| . For these k we thus 
have 

P A (k + s) = d(A s ) fc e~ J ^ 2 [1 + 0(5)] , (7.4) 

where J(s) is periodic with a unique minimum per period in s* . The minimal value J(s*) 
is close to the value of the rate function of the optimal path up to level 1 — 5. 
Now let us fix an initial condition (1 — S, (fi), and consider the probability 

pi-^ij^ G [<p 2 ,<p 2 + A]} 



(7.5) 



of first reaching the unstable orbit during the interval [ip 2 , <f>2 + A] when starting at level 
1 — 5. By the same argument as above, 



Qa(vi,* + s) = C 2 (AS) £ P 7r o{^ 6 [s, s + A]} 



1 + 



(7.6) 



On the other hand, by the large-deviation principle (cf. Proposition 14. 2p . we have 



Qa(<Pi,£ + s) = A?(s)exp 
where 



(7- 



C ( s ) = 5 2 e- 2A + T ^ hPeT{s) 



^ per (^i) 2 ' 



+ 0(5)] , (7.7) 



(7.8) 



and the cr-dependent prefactor satisfies lim a ^o a 2 log Di(s) = 0. In particular we have 

- hm a 2 log Q A (<pi,£ + s) = 1^ + ^ e" 2 ^ T + [l + 0(e~ m + T +) + 0(6)] . (7.9) 

This implies in particular that the kernel K u has a spectral gap. Indeed, assume by 
contradiction that lim (T _>o(Ai Mo) = Using this in (|7,6p . we obtain that a 2 log Qa^i, £+ 
s) converges to a quantity independent of £, which contradicts (|7.9p . We thus conclude 
that lim (T _ i .o(Ai /Aq) = p < 1. In fact, (|T.9[) suggests that p ~ e~ 2A+T +, but we will not 
attempt to prove this here. The existence of a spectral gap shows that for all £ 3> | log <5 1 , 
we have 

D e (s) = (\» ) t D*(s)[l + O(6)] , (7.10) 

where D*(s) = lim£_ >00 (Ao) _£ D^(s). Periodicity implies that D^{s + 1) = £>£ +1 (s), so that 
necessarily 

D*(s + 1) = e" 2A + T + D*(s) [1 + 0{5)] . (7.11) 

Let us now define 



6{s) = -hogc(s) = \ + T + s-Uot 



5 2 h^(s) 
/iP er (v?i) s 



(7.12) 



Note that 6(s + 1) = 6(s) + A+T+. Since a~ 



,2|loga| 



, we can rewrite (|7.7|) in the form 



Q A (^^ + s) =jD ,( s)e -W- 2 exp |_l e -2[^A + T + -|lo g( x|] [l + ( e - 2 ^+) + 0(5)]| . 

(7-13) 

In a similar way, the prefactor Di(s) can be rewritten, for I 3> | log <5| , as 



D e (s) = a 2 e 29{s) D*(s)exp\ -2 \o(a) + £X+T + [1 + 0(5)\ - |loga| 



(7.14) 
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Introducing the notation 

A(x) = Iexp{-2x-ie- 2x ] , (7.15) 

we can thus write 

Q A {<Pi,£ + s) = 2a 2 e- I °°l u ' 2 e 29 ^ D*(s)A(6(s) + IX+T+ - |log a\ + 0{5l)\ . (7.16) 

Let us finally consider the probability 

P ro '°{^r G [n + s,n + s + A}} . (7.17) 

As in |BG04I Section 5], it can be written as an integral over tpi, which can be approximated 
by the sum 

n-l 

S = J2 p A(n-i + s*)Q A (s*,£ + s) . (7.18) 
e=i 

Note that since K n is defined by killing the process when it reaches a distance 25 from 
the unstable periodic orbit, we have to show that the contribution of paths switching back 
and forth several times between distance 5 and 25 is negligible (cf. [BG04, Section 4.3]). 
This is the case here as well, in fact we have used the same argument in the proof of 
Proposition 16.51 From here on, we can proceed as in [BG04j Section 5.2], to obtain 

S = da 2 e ~( / -+ J ( s *))/' j2 (A s ) n e 29(s) D*(s)S 1 , (7.19) 

where 

oo 

Sx= ^W)- i A(e(s) + £\+T + -\\oga\+0(5j)[l + 0(5\\og5\)]. (7.20) 

£=-oo 

The main point is to note that only indices t in a window of order |log 5\ contribute to the 
sum, and that A(x + e) = A(x)[l + 0(e)] for x ^ 0. Also, since Ag is exponentially close 
to 1, the factor (Ag) - ^ can be replaced by 1, with an error which is negligible compared to 
<5|log 5\. Extending the bounds to ±oo only generates a small error. Now (|7.19|) and (|7.2U[) 
yield the main result, recalling that xa(«) = e 29 ^ D*(s) is periodic in s. 

A A Bernstein— type estimate 

Lemma A.l ([RW00, Thm. 37.8]). Consider the martingale 

M t = f g(X s ,s)dW s , (A.l) 
Jo 

where X s is adapted to the filtration generated by W s . Assume 

g(X s ,s)g(X s ,sf ^G(s) 2 (A.2) 

almost surely and that 

V(t) = [ G(s) 2 ds < oo . (A.3) 
Jo 

Then 

P{ sup M S >L) ^ e' L2 / 2V ^ (A.4) 

holds for all L > 0. 
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